268
17
Genomics
indeed the human gastrointestinal tract; cf. Chap. 19). It may sometimes be necessary
to clone the DNA (using cultured laboratory bacteria) in order to produce sufficient
material for further analysis.
The most basic analysis is simply to sequence all the DNA. This may result
in millions of genes, which can be compared with known sequences; early work
appeared to reveal an astonishing diversity of whole classes of hitherto unknown
genes. One should, however, be mindful of the influence of sequencing errors in
giving the appearance of more novelty than is actually the case. 30
Function-based metagenomics obviates the need to sequence the DNA by letting
the fragments be translated, again in laboratory-cultured bacteria. Novel proteins of
phenotypes are then further analysed.
Metagenomics can be used to better understand the diversity and functions of
microbial communities in a wide range of habitats, and to gain insights into the
evolutionary processes that shape the communities. It can also be used to detect and
control potentially disease-causing organisms, and explore the potential of microbial
communities for producing substances useful to mankind.
References
Allison L, Powell D, Dix TI (1999) Compression and approximate matching. Comput J 42:1–10
Bibikova M, Fan JB (2010) Genome-wide DNA methylation profiling. WIREs Syst Biol Med
2:210–223
Bibikova M et al (2006) High-throughput DNA methylation profiling using universal bead arrays.
Genome Res 16:383–393
Braslavsky I, Hebert B, Kartalov E, Quake SR (2003) Sequence information can be obtained from
single DNA molecules. Proc Natl Acad Sci USA 100:3960–3964
Brinkman AB, Simmer F, Ma K, Kaan A, Zhu J, Stunnenberg HG (2010) Whole-genome DNA
methylation profiling using MethylCap-seq. Methods 52:232–236
Committee on Metagenomics: Challenges and Functional Applications (2007) The new science of
metagenomics. National Academies Press, Washington DC
França LTC, Carrilho E, Kist TB (2002) A review of DNA sequencing techniques. Q Rev Biophys
35:169–200
Gorban AN, Popova TG, Sadovsky MG (2000) Classification of symbol sequences over their fre-
quency dictionaries. Open Syst Inform Dynam 7:1–17
Gorban AN, Popova TG, Zinovyev A (2005) Codon usage trajectories and 7-cluster structure of
143 complete bacterial genomic sequences. Physica A 353:365–387
Heyn H, Esteller M (2012) DNA methylation profiling in the clinic: applications and challenges.
Nat Rev Genet 13:679–692
IHGSC (International Human Genome Sequencing Consortium) (2001) Initial sequencing and
analysis of the human genome. Nature 409:860–921
Karlin S, Ladunga I, Blaisdell BE (1994) Heterogeneity of genomes: measures and values. Proc
Natl Acad Sci USA 91:12837–12841
Kruskal JB (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypoth-
esis. Psychometrika 29:1–27
30 Quince et al. (2009).